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In this paper we consider the transfer of records from a manual 
file system (MFS) to a mechanized data base management system 
when the conversion takes place through two processes. In the going- 
forward process a transfer is made by the regular work force when 
an order is received to change or delete a record in the MFS. In 
addition, the transfer of records is carried out by a crash force, 
working at a fixed rate. We investigate the expected number of 
records remaining in the MFS at future times, and the expected 
number of records removed from the MFS during the corresponding 
periods by the going- forward work force and by the crash force. We 
also derive the expected time taken to go from a prescribed number 
of records in the MFS to a smaller number. We give simple approxi- 
mations for all of these quantities. The results have been used 
elsewhere in the construction of an economic model used to estimate 
cost/benefits and labor force levels during the mechanization of 
transaction data bases. The numerical results presented graphically 
are typical of two Bell System applications. 



I. INTRODUCTION 

In this paper, we analyze the generic problem of implementing a 
system that requires manual effort for converting paper records in a 
manual file system (MFS) to mechanized computer records in a data 
base management system (DBMS). We can achieve conversion to a 
mechanized environment in a number of ways: by going-forward on an 
activity basis, by crash-conversion utilizing an augmented work force, 
or by some mix of the two dictated by available resources such as 
money, people, space, and time objectives. 

A paper record is mechanized using the going-forward approach if 
the record is transferred from the MFS to the DBMS by regular record 
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maintenance employees when a change or deletion activity is per- 
formed upon that record. If a randomly chosen record is transferred 
by a separate temporary work force, the record is said to be transferred 
using the crash approach. The paper record could contain all of the 
information that is intended to be mechanized or an enhancement to 
an already existing mechanized record. 

The arrival times of activities (change or deletion orders) for a 
particular record are assumed to form a Poisson process. We further 
assume that the distributions of the activity arrival times for every 
record are independent and identical. Hence, the sequence of times at 
which these change and deletion activities occur forms a nonstationary 
Poisson process. In addition, the transfer of records is carried out by 
a special task force with constant total mean rate, the amount of work 
required for a record to be transferred being exponentially distributed. 
Hence, there are two different stochastic processes that are concur- 
rently operating against the MFS. Each record in the MFS is trans- 
ferred if an activity occurs upon it, while at the same time each record 
in the MFS has a chance of randomly being selected by a crash person 
when he/she completes a conversion. 

When an activity occurs against a record in the MFS, it will be 
assumed that a going-forward person will immediately attend to trans- 
ferring and processing the record. Therefore, if substantially more than 
the average number of records concurrently experience activity, it will 
be assumed that management will borrow going-forward people from 
other areas or have the normal going-forward force work overtime. 
However, the time it takes for a crash-conversion person to perform 
the conversion is pertinent since the force is assumed to be fixed. In 
this case, the average number of records that the entire crash conver- 
sion force can convert in a specified period determines the mean of the 
stochastic process describing the crash effort. 

In this paper we highlight the derivations of formulas that are used 
in the construction of a model 1 used to estimate cost/benefits and 
labor force levels during the mechanization of transaction data bases. 
In the next section, we investigate the expected number of records 
remaining in the MFS, removed by the going-forward people, and 
removed by the crash force people. In particular, we point out a simple 
approximation to the expected number of records remaining in the 
MFS, which is valid over a significant part of the range of interest. We 
derive the expected passage time from I to m records in the MFS (/ 
> m) in Section III, and obtain a simple asymptotic approximation for 
the expected time at which the MFS becomes empty. Finally, in 
Section IV, we present some conclusions, and discuss the relevance of 
the results to Bell System applications. 
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II. EXPECTED NUMBER OF RECORDS 

We will now consider a stochastic model for the transfer of records 
from a MFS to a DBMS. We assume that there are n crash force 
people, and that the conversion times of the records are independent 
and exponentially distributed with mean rate p. At time t = 0- there 
are S records in the MFS. At time t = there are n of these records 
removed from the MFS by the crash force. When the conversion of a 
record is completed, that record is instantaneously transferred to the 
DBMS, and another record is removed from the MFS by the crash 
person involved. A record is also removed from the MFS, by a going- 
forward person, when a change or deletion order is received; that 
record is transferred to the DBMS at some later time. We assume that 
the sequence of times at which change and deletion orders are received 
forms a nonstationary Poisson process with rate iju where i is the 
number of records in the MFS. (The mean deletion and change order 
rates are summed to get /x, since it is a well-known fact that if two 
independent Poisson processes are affecting the records, then the 
composite of these two processes can be described by a Poisson process 
where the parameters are summed.) If a change or deletion order is 
received for a record that is in the process of being converted by a 
crash person, the change or deletion order is not executed until after 
the record has been transferred to the DBMS. 

The number of records in the MFS at time r = 0+ is 

M = S-n. (1) 

For convenience, we let 

np = vfi. (2) 

This is the mean rate at which the crash force converts records. We 
also let P,(t) denote the probability that there are i records in the 
MFS at time t. But, the number of records in the MFS is a pure death 
process, 2 with death rate 

IU = np(l - 8 i0 ) + ill - Ml>(l - &o) + »1 i = 0,---,M, (3) 

where 5„ denotes the Kronecker delta; S„ = 1 if t =j, and S„ = if i ¥>}. 
Hence, 

— = -«[ r (i - S„>) + i]P, + n(l - SiuHv + i + DP*«, (4) 

dr 

for t > and i - 0, • • • , M . The initial condition at time t = 0+ is 

Pi(0+) = 8 lM , i = 0,---,M. (5) 

The expected number of records in the MFS at time t is 
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N(r) = I iP t (r). (6) 

It follows from (4) through (6) that 

^- + t iN= -^[1 - Po(r)]; N(0+) = M. (7) 

cLt 

It is straightforward to calculate the Laplace transforms of P,, i = 
0, • • • , M, from (4) and (5), and thence that of N from (7). By inverting 
the latter transform, Morrison 3 obtained an explicit expression for 
N(r) in terms of an incomplete beta function. 4 Asymptotic approxi- 
mations to N(t), involving the complementary error function, 4 were 
derived when M :» 1 and v » 1. 

Since < P (t) < 1, it follows from (7) that 

L(t) = (M + v)e~ ilT -v< N(t) < Me"" 7 . (8) 

The lower bound is, of course, superfluous if L(t) < 0. When M » 1 
and v <sc M, these bounds yield an accurate approximation to N(t) as 
long as Me _/iT :» v. An upper bound for N(t), which is valid only for 
L(t) > 0, and exact for L(t) = 0, was derived from the exact result. 3 It 
was shown that 

< N(t) - L(t) < ll™*^ e-^d - e~n M for L(r) > 0. (9) 

From (9) we deduce that 

AT(t)=L(t) for M»l, L(t) »min(v/M, v^)^ — , (10) 

yJvT{v) 

where, from the properties of the gamma function, 4 

v"e~ v v'e~'' 1 

— — <v for ^>0; — — for r»l. (11) 

r(") s/vT(v) V2^ 

Hence, (10) gives a larger range of validity for the approximation 
JV(t) ~ L(t) than does (8), and significantly so if v :» 1, as it is in cases 
of interest. We remark that intuitively we expect Po(t), the probability 
that there are no records in the MFS at time t, to be extremely small 
for a significant time when M :» 1, and in that time interval we deduce 
from (7) that N(t) ~ L(r). The intuitive derivation, however, does not 
give the range of validity of the approximation. 

Typical numerical results are illustrated in Figs. 1 and 2 for the case 
S = 10 5 , n - 10, p - 0.001608 per day, and v = 18657. Only N(t) and 
the lower bound L{r) axe shown in Fig. 1. The upper bound given by 
(9) is also shown in Fig. 2. This figure depicts the tail region, in which 
the expected number of records in the MFS is considerably fewer than 
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Fig. 1 — Expected number of records in the MFS, and lower bound, as a function of 
time, for S = 10\ n - 10, /i = 0.001608 per day, and v = 18657. 



the initial number. It is seen that N(t) is still quite close to L(t) for 
values of L(t) just a few times greater thanmin(vM, \M/v27r ~ 54.5. 
Another quantity of interest is the expected number of records F(t) 
removed from the MFS by the going-forward people in the time 
interval (0, t]. Let P«(t) denote the probability that there are i records 
in the MFS at time t, and that k records have been removed from the 
MFS by the going-forward people in (0, t]. Then 
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M-\ M-i 

F(t) =E E kPikir). 



(12) 



But, 



dPu. 
dr 



= —fl[p(l - 5,o) + i]Pik + ILV{\ - Sk,M-i)Pi+l,k 

+ [i{l - 8ko){i+ l)Pi+i,*-i, 



(13) 
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Fig. 2 — Expected number of records in the MFS, and upper and lower bounds, as a 
function of time, for S = 10 5 , n = 10, n = 0.001608 per day, and v = 18657. 
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for t > 0, k = 0, ■■■ ,M- i, and i = 0, • • • , M. The initial condition at 
time t = 0+ is 

Pik(0+) = 8iM, k - 0, • • • , M - i, and i = 0, . . . , M. (14) 

We note that i = M implies that k = 0. Also, 

M-i 

P.-(t) = £ fiiM, (15) 



and if we sum (13) and (14) from k = to M - i we obtain (4) and (5). 
It follows directly from (12), (13), and (15) that 

^- = M I iPM = fiN(r). (16) 

This implies that the rate of change of the expected number of records 
removed from the MFS by the going-forward people is equal to the 
expected value at time r of the rate at which change and deletion 
orders are received. If we let C(t) denote the expected number of 
records removed from the MFS by the crash people in (0, t], then 

C(t) = M - N(t) - F(t). (17) 

From (7), (16), and (17), we obtain 

dC M 

^ = iiv[l - Po(t)] - tiv 2 (1 - 5,o)P,(r), (18) 

dr ( =o 

which has an interpretation analogous to that of (16). We note that 
dC/dr » jip in the region where P (t) is small. Also, corresponding to 
the approximation N(r) a L(t), where L(t) is defined in (8), it follows 
from (16) that 

F(t) » (M+ v){l - e-" T ) - prr, (19) 

since F(0+) = 0. 

III. EXPECTED PASSAGE TIMES 

We next turn to the calculation of the expected passage time from 
I torn records in the MFS. Now, 5 for a pure death process, with death 
rate (a, the passage time t,.,-i from state i to state i — 1 is exponentially 
distributed with density /x, exp(-ju,T). Hence, in particular, the expected 
value of T,,,-i is 

£t,.,-i=-. (20) 

The passage time from I to m records in the MFS is 



/ 



1=771 + 1 



Tl.m = £ T,,,-l, (21) 

1 
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and the random variables in the sum are independent. From (3), (20), 
and (21), we obtain 

' 1 

fiEr,, m = £ ——- = W+p+l)-Mm + v+ 1), (22) 

i= m +\ \v + i) 

where yp denotes the logarithmic derivative of the gamma function. 4 
We will make use of the asymptotic result 

+(x) = log x + O (-) for x » 1. (23) 

Hence, from (22), the expected time at which the MFS becomes empty 
is given by 

hEtmjd = \I>(M + v + 1) - ${v + 1) 

, (M+v\ 
~ log for v » 1. (24) 



Also, the expected time starting from / records until the MFS becomes 
empty is given by 

lxEr, fl - $(1 + v + 1) - \p(v + 1) ~ log ( ^j for v » 1 

for v » 1, I <K v. (25) 

v 

Next, the expected time at which the number of records in the MFS 
first reaches m is given by 

llETM.n, = yp{M + V + 1) - \J/(m + V + 1) 

, (M+v\ 

-log — for j<»1. (26) 

\m + v ) 

But, from (8) to (10), the time r m at which the expected number of 
records in the MFS is equal to m satisfies 

, (M + p" 

fiT m ~ log - 

\m + v t 

for Af»l, p»l, m»min(v / M, y/v)/-j2v. (27) 

Hence, under the restrictions in (27), we have Etm,™ ~ r m . 

We now consider the final stage of the conversion by the crash force, 
starting from the time tmjd when the MFS becomes empty. Each of the 
n crash people is then busy with a record. We first assume that a 
person is removed from the crash force when he/she completes the 
conversion of the record on which he/she has been working. If j is the 
number of records remaining to be converted by the crash force, we 
then have a pure death process with /x, =jp,j = 0, • • • , n. If t c denotes 
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the time at which the conversion by the crash force is completed, then, 
with the help of (2), 

[lEirc - tm.o) =m£- = -£*=- W» + 1) " +0)1 < 28 > 

At the other extreme, we assume that the entire crash force works 
jointly on the remaining records, until the conversion of the last record 
is completed. This will give a lower bound on the expected time to 
complete the conversion of the final n records, since the crash force 
continues to work at its maximum rate. In this case the death rate is 
fa = np = vfi, j = I, ••• , n. If t c denotes the corresponding time at 
which the conversion by the crash force is completed, then 

HE(t c -tm,o)=H 2 - = -• ( 29 > 

/-i ft v 

IV. CONCLUSION 

In this paper we presented formulas that are employed in an eco- 
nomic model 1 used to estimate cost/benefits and labor force levels 
associated with the mechanization of transaction data bases. Because 
the formulas can be computed very rapidly on any modern computer, 
the model can be used as an interactive managerial decision tool to 
perform what-if studies in real time. Numerical results presented 
graphically are typical of two applications, namely, (i) the conversion 
of manual records in a service order processing operation of a Bell 
Operating Company business office, and (ii) the mechanization of 
equipment inventory records during the implementation of Trunks 
Integrated Record Keeping System." We do not presume that the 
stochastic model is appropriate for all possible data base conversions, 
but evidence from field trials indicates that it does adequately model 
a certain class of Bell System mechanization problems. 
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